Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 2278 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 356.1 KiB |
| Average record size in memory | 160.1 B |
Variable types
| Categorical | 3 |
|---|---|
| Numeric | 15 |
| Boolean | 2 |
Churn has constant value "0" | Constant |
State has a high cardinality: 51 distinct values | High cardinality |
Total day minutes is highly correlated with Total day charge | High correlation |
Total day charge is highly correlated with Total day minutes | High correlation |
Total eve minutes is highly correlated with Total eve charge | High correlation |
Total eve charge is highly correlated with Total eve minutes | High correlation |
Total night minutes is highly correlated with Total night charge | High correlation |
Total night charge is highly correlated with Total night minutes | High correlation |
Total intl minutes is highly correlated with Total intl charge | High correlation |
Total intl charge is highly correlated with Total intl minutes | High correlation |
Area code is highly correlated with Churn | High correlation |
Churn is highly correlated with Area code and 3 other fields | High correlation |
International plan is highly correlated with Churn | High correlation |
State is highly correlated with Churn | High correlation |
Voice mail plan is highly correlated with Churn | High correlation |
Number vmail messages has 1610 (70.7%) zeros | Zeros |
Customer service calls has 476 (20.9%) zeros | Zeros |
Reproduction
| Analysis started | 2021-04-10 21:21:00.680124 |
|---|---|
| Analysis finished | 2021-04-10 21:21:25.427181 |
| Duration | 24.75 seconds |
| Software version | pandas-profiling v2.11.0 |
| Download configuration | config.yaml |
| Distinct | 51 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.9 KiB |
| WV | 81 |
|---|---|
| VA | 63 |
| AL | 59 |
| WY | 58 |
| MN | 57 |
| Other values (46) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 4556 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | KS |
|---|---|
| 2nd row | OH |
| 3rd row | NJ |
| 4th row | OH |
| 5th row | OK |
| Value | Count | Frequency (%) |
| WV | 81 | 3.6% |
| VA | 63 | 2.8% |
| AL | 59 | 2.6% |
| WY | 58 | 2.5% |
| MN | 57 | 2.5% |
| WI | 57 | 2.5% |
| NY | 56 | 2.5% |
| OH | 56 | 2.5% |
| OR | 55 | 2.4% |
| CO | 52 | 2.3% |
| Other values (41) | 1684 |
| Value | Count | Frequency (%) |
| wv | 81 | 3.6% |
| va | 63 | 2.8% |
| al | 59 | 2.6% |
| wy | 58 | 2.5% |
| wi | 57 | 2.5% |
| mn | 57 | 2.5% |
| ny | 56 | 2.5% |
| oh | 56 | 2.5% |
| or | 55 | 2.4% |
| ut | 52 | 2.3% |
| Other values (41) | 1684 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 483 | 10.6% |
| A | 477 | 10.5% |
| M | 396 | 8.7% |
| I | 364 | 8.0% |
| T | 269 | 5.9% |
| D | 263 | 5.8% |
| O | 254 | 5.6% |
| C | 244 | 5.4% |
| V | 243 | 5.3% |
| W | 234 | 5.1% |
| Other values (14) | 1329 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4556 |
Most frequent character per category
| Value | Count | Frequency (%) |
| N | 483 | 10.6% |
| A | 477 | 10.5% |
| M | 396 | 8.7% |
| I | 364 | 8.0% |
| T | 269 | 5.9% |
| D | 263 | 5.8% |
| O | 254 | 5.6% |
| C | 244 | 5.4% |
| V | 243 | 5.3% |
| W | 234 | 5.1% |
| Other values (14) | 1329 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4556 |
Most frequent character per script
| Value | Count | Frequency (%) |
| N | 483 | 10.6% |
| A | 477 | 10.5% |
| M | 396 | 8.7% |
| I | 364 | 8.0% |
| T | 269 | 5.9% |
| D | 263 | 5.8% |
| O | 254 | 5.6% |
| C | 244 | 5.4% |
| V | 243 | 5.3% |
| W | 234 | 5.1% |
| Other values (14) | 1329 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4556 |
Most frequent character per block
| Value | Count | Frequency (%) |
| N | 483 | 10.6% |
| A | 477 | 10.5% |
| M | 396 | 8.7% |
| I | 364 | 8.0% |
| T | 269 | 5.9% |
| D | 263 | 5.8% |
| O | 254 | 5.6% |
| C | 244 | 5.4% |
| V | 243 | 5.3% |
| W | 234 | 5.1% |
| Other values (14) | 1329 |
Account length
Real number (ℝ≥0)
| Distinct | 202 |
|---|---|
| Distinct (%) | 8.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 100.3309921 |
|---|---|
| Minimum | 1 |
| Maximum | 243 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 17.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 36 |
| Q1 | 73 |
| median | 100 |
| Q3 | 127 |
| 95-th percentile | 165.15 |
| Maximum | 243 |
| Range | 242 |
| Interquartile range (IQR) | 54 |
Descriptive statistics
| Standard deviation | 39.45893632 |
|---|---|
| Coefficient of variation (CV) | 0.3932876123 |
| Kurtosis | -0.1581933155 |
| Mean | 100.3309921 |
| Median Absolute Deviation (MAD) | 27 |
| Skewness | 0.07601251352 |
| Sum | 228554 |
| Variance | 1557.007656 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 87 | 31 | 1.4% |
| 101 | 28 | 1.2% |
| 93 | 28 | 1.2% |
| 105 | 27 | 1.2% |
| 90 | 27 | 1.2% |
| 86 | 27 | 1.2% |
| 99 | 27 | 1.2% |
| 100 | 26 | 1.1% |
| 106 | 25 | 1.1% |
| 78 | 25 | 1.1% |
| Other values (192) | 2007 |
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 3 | 4 | |
| 4 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 243 | 1 | |
| 225 | 1 | |
| 224 | 1 | |
| 221 | 1 | |
| 217 | 1 |
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.9 KiB |
| 415 | |
|---|---|
| 510 | |
| 408 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 6834 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 415 |
|---|---|
| 2nd row | 415 |
| 3rd row | 415 |
| 4th row | 408 |
| 5th row | 415 |
| Value | Count | Frequency (%) |
| 415 | 1123 | |
| 510 | 580 | |
| 408 | 575 |
| Value | Count | Frequency (%) |
| 415 | 1123 | |
| 510 | 580 | |
| 408 | 575 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1703 | |
| 5 | 1703 | |
| 4 | 1698 | |
| 0 | 1155 | |
| 8 | 575 | 8.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6834 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 1 | 1703 | |
| 5 | 1703 | |
| 4 | 1698 | |
| 0 | 1155 | |
| 8 | 575 | 8.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6834 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 1 | 1703 | |
| 5 | 1703 | |
| 4 | 1698 | |
| 0 | 1155 | |
| 8 | 575 | 8.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6834 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 1 | 1703 | |
| 5 | 1703 | |
| 4 | 1698 | |
| 0 | 1155 | |
| 8 | 575 | 8.4% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 KiB |
| False | |
|---|---|
| True | 152 |
| Value | Count | Frequency (%) |
| False | 2126 | |
| True | 152 | 6.7% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 1610 | |
| True | 668 |
| Distinct | 42 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.507462687 |
|---|---|
| Minimum | 0 |
| Maximum | 50 |
| Zeros | 1610 |
| Zeros (%) | 70.7% |
| Memory size | 17.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 21 |
| 95-th percentile | 37 |
| Maximum | 50 |
| Range | 50 |
| Interquartile range (IQR) | 21 |
Descriptive statistics
| Standard deviation | 13.83016017 |
|---|---|
| Coefficient of variation (CV) | 1.625650406 |
| Kurtosis | -0.2688480892 |
| Mean | 8.507462687 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.178368517 |
| Sum | 19380 |
| Variance | 191.2733303 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1610 | |
| 31 | 46 | 2.0% |
| 28 | 38 | 1.7% |
| 24 | 36 | 1.6% |
| 30 | 33 | 1.4% |
| 29 | 33 | 1.4% |
| 25 | 33 | 1.4% |
| 27 | 31 | 1.4% |
| 33 | 31 | 1.4% |
| 23 | 30 | 1.3% |
| Other values (32) | 357 | 15.7% |
| Value | Count | Frequency (%) |
| 0 | 1610 | |
| 4 | 1 | < 0.1% |
| 8 | 2 | 0.1% |
| 9 | 2 | 0.1% |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 50 | 2 | 0.1% |
| 47 | 3 | |
| 46 | 3 | |
| 45 | 3 | |
| 44 | 5 |
| Distinct | 1324 |
|---|---|
| Distinct (%) | 58.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 175.1043459 |
|---|---|
| Minimum | 0 |
| Maximum | 313.8 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 17.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 89.7 |
| Q1 | 142.5 |
| median | 177.9 |
| Q3 | 209.8 |
| 95-th percentile | 253.76 |
| Maximum | 313.8 |
| Range | 313.8 |
| Interquartile range (IQR) | 67.3 |
Descriptive statistics
| Standard deviation | 50.10533422 |
|---|---|
| Coefficient of variation (CV) | 0.2861455777 |
| Kurtosis | 0.03134419361 |
| Mean | 175.1043459 |
| Median Absolute Deviation (MAD) | 33.8 |
| Skewness | -0.2481924827 |
| Sum | 398887.7 |
| Variance | 2510.544518 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 183.4 | 7 | 0.3% |
| 185 | 6 | 0.3% |
| 175.4 | 6 | 0.3% |
| 159.5 | 6 | 0.3% |
| 194.8 | 6 | 0.3% |
| 194.4 | 5 | 0.2% |
| 162.3 | 5 | 0.2% |
| 206.2 | 5 | 0.2% |
| 124.3 | 5 | 0.2% |
| 178.7 | 5 | 0.2% |
| Other values (1314) | 2222 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 2.6 | 1 | |
| 7.8 | 1 | |
| 7.9 | 1 | |
| 12.5 | 1 |
| Value | Count | Frequency (%) |
| 313.8 | 1 | |
| 309.9 | 1 | |
| 308 | 1 | |
| 307.1 | 1 | |
| 305.2 | 1 |
Total day calls
Real number (ℝ≥0)
| Distinct | 114 |
|---|---|
| Distinct (%) | 5.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 100.1593503 |
|---|---|
| Minimum | 0 |
| Maximum | 160 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 17.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 67 |
| Q1 | 87 |
| median | 100 |
| Q3 | 113 |
| 95-th percentile | 133 |
| Maximum | 160 |
| Range | 160 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 19.6819145 |
|---|---|
| Coefficient of variation (CV) | 0.1965060121 |
| Kurtosis | 0.1403284446 |
| Mean | 100.1593503 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.06015272982 |
| Sum | 228163 |
| Variance | 387.3777584 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 107 | 54 | 2.4% |
| 105 | 54 | 2.4% |
| 102 | 51 | 2.2% |
| 88 | 50 | 2.2% |
| 100 | 49 | 2.2% |
| 98 | 49 | 2.2% |
| 104 | 49 | 2.2% |
| 112 | 48 | 2.1% |
| 95 | 48 | 2.1% |
| 108 | 47 | 2.1% |
| Other values (104) | 1779 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 36 | 1 | |
| 40 | 1 | |
| 42 | 1 | |
| 44 | 1 |
| Value | Count | Frequency (%) |
| 160 | 1 | < 0.1% |
| 158 | 3 | |
| 157 | 1 | < 0.1% |
| 152 | 1 | < 0.1% |
| 151 | 3 |
| Distinct | 1324 |
|---|---|
| Distinct (%) | 58.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29.76826602 |
|---|---|
| Minimum | 0 |
| Maximum | 53.35 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 17.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 15.25 |
| Q1 | 24.23 |
| median | 30.24 |
| Q3 | 35.67 |
| 95-th percentile | 43.1405 |
| Maximum | 53.35 |
| Range | 53.35 |
| Interquartile range (IQR) | 11.44 |
Descriptive statistics
| Standard deviation | 8.517839 |
|---|---|
| Coefficient of variation (CV) | 0.2861382317 |
| Kurtosis | 0.0314560443 |
| Mean | 29.76826602 |
| Median Absolute Deviation (MAD) | 5.75 |
| Skewness | -0.2481977403 |
| Sum | 67812.11 |
| Variance | 72.55358123 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 31.18 | 7 | 0.3% |
| 27.12 | 6 | 0.3% |
| 31.45 | 6 | 0.3% |
| 33.12 | 6 | 0.3% |
| 29.82 | 6 | 0.3% |
| 36.65 | 5 | 0.2% |
| 28.99 | 5 | 0.2% |
| 35.17 | 5 | 0.2% |
| 36.72 | 5 | 0.2% |
| 35.05 | 5 | 0.2% |
| Other values (1314) | 2222 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 0.44 | 1 | |
| 1.33 | 1 | |
| 1.34 | 1 | |
| 2.13 | 1 |
| Value | Count | Frequency (%) |
| 53.35 | 1 | |
| 52.68 | 1 | |
| 52.36 | 1 | |
| 52.21 | 1 | |
| 51.88 | 1 |
| Distinct | 1325 |
|---|---|
| Distinct (%) | 58.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 198.8533802 |
|---|---|
| Minimum | 0 |
| Maximum | 354.2 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 17.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 117.77 |
| Q1 | 163.625 |
| median | 199.55 |
| Q3 | 233.475 |
| 95-th percentile | 283.3 |
| Maximum | 354.2 |
| Range | 354.2 |
| Interquartile range (IQR) | 69.85 |
Descriptive statistics
| Standard deviation | 50.81895405 |
|---|---|
| Coefficient of variation (CV) | 0.2555599206 |
| Kurtosis | -0.01851011919 |
| Mean | 198.8533802 |
| Median Absolute Deviation (MAD) | 34.9 |
| Skewness | -0.02183366134 |
| Sum | 452988 |
| Variance | 2582.566091 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 167.2 | 7 | 0.3% |
| 161.7 | 7 | 0.3% |
| 195.5 | 6 | 0.3% |
| 219.1 | 6 | 0.3% |
| 205.1 | 6 | 0.3% |
| 169.9 | 6 | 0.3% |
| 220.6 | 6 | 0.3% |
| 230.1 | 5 | 0.2% |
| 241.4 | 5 | 0.2% |
| 203.8 | 5 | 0.2% |
| Other values (1315) | 2219 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 31.2 | 1 | |
| 42.2 | 1 | |
| 42.5 | 1 | |
| 43.9 | 1 |
| Value | Count | Frequency (%) |
| 354.2 | 1 | |
| 348.5 | 1 | |
| 341.3 | 1 | |
| 337.1 | 1 | |
| 335.7 | 1 |
Total eve calls
Real number (ℝ≥0)
| Distinct | 119 |
|---|---|
| Distinct (%) | 5.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 100.0364355 |
|---|---|
| Minimum | 0 |
| Maximum | 170 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 17.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 66 |
| Q1 | 87 |
| median | 100 |
| Q3 | 114 |
| 95-th percentile | 134 |
| Maximum | 170 |
| Range | 170 |
| Interquartile range (IQR) | 27 |
Descriptive statistics
| Standard deviation | 20.25879961 |
|---|---|
| Coefficient of variation (CV) | 0.2025142091 |
| Kurtosis | 0.2571965773 |
| Mean | 100.0364355 |
| Median Absolute Deviation (MAD) | 13.5 |
| Skewness | -0.06574156795 |
| Sum | 227883 |
| Variance | 410.4189617 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 105 | 55 | 2.4% |
| 109 | 51 | 2.2% |
| 94 | 50 | 2.2% |
| 115 | 47 | 2.1% |
| 97 | 47 | 2.1% |
| 87 | 46 | 2.0% |
| 108 | 46 | 2.0% |
| 98 | 45 | 2.0% |
| 99 | 45 | 2.0% |
| 95 | 44 | 1.9% |
| Other values (109) | 1802 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 12 | 1 | |
| 36 | 1 | |
| 42 | 1 | |
| 43 | 1 |
| Value | Count | Frequency (%) |
| 170 | 1 | |
| 157 | 1 | |
| 156 | 1 | |
| 155 | 2 | |
| 154 | 2 |
| Distinct | 1205 |
|---|---|
| Distinct (%) | 52.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.90280948 |
|---|---|
| Minimum | 0 |
| Maximum | 30.11 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 17.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 10.0085 |
| Q1 | 13.91 |
| median | 16.965 |
| Q3 | 19.8475 |
| 95-th percentile | 24.08 |
| Maximum | 30.11 |
| Range | 30.11 |
| Interquartile range (IQR) | 5.9375 |
Descriptive statistics
| Standard deviation | 4.31961408 |
|---|---|
| Coefficient of variation (CV) | 0.2555559822 |
| Kurtosis | -0.01860933171 |
| Mean | 16.90280948 |
| Median Absolute Deviation (MAD) | 2.97 |
| Skewness | -0.02179073796 |
| Sum | 38504.6 |
| Variance | 18.6590658 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 14.25 | 9 | 0.4% |
| 17.43 | 8 | 0.4% |
| 16.12 | 8 | 0.4% |
| 17.99 | 8 | 0.4% |
| 18.62 | 8 | 0.4% |
| 13.74 | 7 | 0.3% |
| 18.96 | 7 | 0.3% |
| 18.16 | 7 | 0.3% |
| 14.21 | 7 | 0.3% |
| 12.95 | 6 | 0.3% |
| Other values (1195) | 2203 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 2.65 | 1 | |
| 3.59 | 1 | |
| 3.61 | 1 | |
| 3.73 | 1 |
| Value | Count | Frequency (%) |
| 30.11 | 1 | |
| 29.62 | 1 | |
| 29.01 | 1 | |
| 28.65 | 1 | |
| 28.53 | 1 |
| Distinct | 1333 |
|---|---|
| Distinct (%) | 58.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 200.4640913 |
|---|---|
| Minimum | 43.7 |
| Maximum | 395 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 17.9 KiB |
Quantile statistics
| Minimum | 43.7 |
|---|---|
| 5-th percentile | 114.5 |
| Q1 | 165.825 |
| median | 200 |
| Q3 | 235.675 |
| 95-th percentile | 284.505 |
| Maximum | 395 |
| Range | 351.3 |
| Interquartile range (IQR) | 69.85 |
Descriptive statistics
| Standard deviation | 51.28449606 |
|---|---|
| Coefficient of variation (CV) | 0.2558288406 |
| Kurtosis | 0.0751126226 |
| Mean | 200.4640913 |
| Median Absolute Deviation (MAD) | 34.9 |
| Skewness | 0.03020924338 |
| Sum | 456657.2 |
| Variance | 2630.099536 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 214.6 | 6 | 0.3% |
| 194.3 | 6 | 0.3% |
| 214.7 | 6 | 0.3% |
| 193.6 | 5 | 0.2% |
| 214 | 5 | 0.2% |
| 190.5 | 5 | 0.2% |
| 180.6 | 5 | 0.2% |
| 192.7 | 5 | 0.2% |
| 109.6 | 5 | 0.2% |
| 172.7 | 5 | 0.2% |
| Other values (1323) | 2225 |
| Value | Count | Frequency (%) |
| 43.7 | 1 | |
| 45 | 1 | |
| 50.1 | 2 | |
| 53.3 | 1 | |
| 54 | 1 |
| Value | Count | Frequency (%) |
| 395 | 1 | |
| 381.9 | 1 | |
| 377.5 | 1 | |
| 364.9 | 1 | |
| 364.3 | 1 |
Total night calls
Real number (ℝ≥0)
| Distinct | 117 |
|---|---|
| Distinct (%) | 5.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 100.0079017 |
|---|---|
| Minimum | 33 |
| Maximum | 166 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 17.9 KiB |
Quantile statistics
| Minimum | 33 |
|---|---|
| 5-th percentile | 68 |
| Q1 | 87 |
| median | 100 |
| Q3 | 113 |
| 95-th percentile | 131 |
| Maximum | 166 |
| Range | 133 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 19.30728173 |
|---|---|
| Coefficient of variation (CV) | 0.1930575625 |
| Kurtosis | 0.004822730703 |
| Mean | 100.0079017 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 0.002130693971 |
| Sum | 227818 |
| Variance | 372.7711277 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 105 | 64 | 2.8% |
| 104 | 58 | 2.5% |
| 91 | 54 | 2.4% |
| 96 | 50 | 2.2% |
| 92 | 50 | 2.2% |
| 102 | 49 | 2.2% |
| 100 | 49 | 2.2% |
| 108 | 48 | 2.1% |
| 106 | 47 | 2.1% |
| 98 | 45 | 2.0% |
| Other values (107) | 1764 |
| Value | Count | Frequency (%) |
| 33 | 1 | |
| 36 | 1 | |
| 38 | 1 | |
| 42 | 1 | |
| 44 | 1 |
| Value | Count | Frequency (%) |
| 166 | 1 | |
| 164 | 1 | |
| 157 | 2 | |
| 156 | 2 | |
| 155 | 1 |
| Distinct | 851 |
|---|---|
| Distinct (%) | 37.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.020974539 |
|---|---|
| Minimum | 1.97 |
| Maximum | 17.77 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 17.9 KiB |
Quantile statistics
| Minimum | 1.97 |
|---|---|
| 5-th percentile | 5.15 |
| Q1 | 7.4625 |
| median | 9 |
| Q3 | 10.6075 |
| 95-th percentile | 12.8045 |
| Maximum | 17.77 |
| Range | 15.8 |
| Interquartile range (IQR) | 3.145 |
Descriptive statistics
| Standard deviation | 2.307779214 |
|---|---|
| Coefficient of variation (CV) | 0.2558237144 |
| Kurtosis | 0.07475506147 |
| Mean | 9.020974539 |
| Median Absolute Deviation (MAD) | 1.57 |
| Skewness | 0.03019732788 |
| Sum | 20549.78 |
| Variance | 5.3258449 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 9.66 | 12 | 0.5% |
| 8.57 | 9 | 0.4% |
| 9.14 | 9 | 0.4% |
| 9.32 | 9 | 0.4% |
| 7.15 | 9 | 0.4% |
| 10.49 | 9 | 0.4% |
| 8.47 | 9 | 0.4% |
| 9.63 | 9 | 0.4% |
| 10.35 | 9 | 0.4% |
| 6.48 | 8 | 0.4% |
| Other values (841) | 2186 |
| Value | Count | Frequency (%) |
| 1.97 | 1 | |
| 2.03 | 1 | |
| 2.25 | 2 | |
| 2.4 | 1 | |
| 2.43 | 1 |
| Value | Count | Frequency (%) |
| 17.77 | 1 | |
| 17.19 | 1 | |
| 16.99 | 1 | |
| 16.42 | 1 | |
| 16.39 | 1 |
| Distinct | 156 |
|---|---|
| Distinct (%) | 6.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.13784021 |
|---|---|
| Minimum | 0 |
| Maximum | 18.9 |
| Zeros | 15 |
| Zeros (%) | 0.7% |
| Memory size | 17.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5.7 |
| Q1 | 8.4 |
| median | 10.2 |
| Q3 | 12 |
| 95-th percentile | 14.6 |
| Maximum | 18.9 |
| Range | 18.9 |
| Interquartile range (IQR) | 3.6 |
Descriptive statistics
| Standard deviation | 2.779621729 |
|---|---|
| Coefficient of variation (CV) | 0.274182831 |
| Kurtosis | 0.7158939471 |
| Mean | 10.13784021 |
| Median Absolute Deviation (MAD) | 1.8 |
| Skewness | -0.2700086021 |
| Sum | 23094 |
| Variance | 7.726296958 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 47 | 2.1% |
| 10.2 | 43 | 1.9% |
| 9.8 | 41 | 1.8% |
| 10.9 | 38 | 1.7% |
| 9.1 | 38 | 1.7% |
| 10.6 | 38 | 1.7% |
| 11.4 | 37 | 1.6% |
| 9.5 | 37 | 1.6% |
| 11.2 | 37 | 1.6% |
| 9.9 | 36 | 1.6% |
| Other values (146) | 1886 |
| Value | Count | Frequency (%) |
| 0 | 15 | |
| 1.1 | 1 | < 0.1% |
| 1.3 | 1 | < 0.1% |
| 2.1 | 1 | < 0.1% |
| 2.2 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 18.9 | 1 | |
| 18.4 | 1 | |
| 18.2 | 2 | |
| 18 | 2 | |
| 17.8 | 2 |
Total intl calls
Real number (ℝ≥0)
| Distinct | 20 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.538191396 |
|---|---|
| Minimum | 0 |
| Maximum | 19 |
| Zeros | 15 |
| Zeros (%) | 0.7% |
| Memory size | 17.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1.85 |
| Q1 | 3 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 9 |
| Maximum | 19 |
| Range | 19 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.447532646 |
|---|---|
| Coefficient of variation (CV) | 0.539318956 |
| Kurtosis | 2.817344834 |
| Mean | 4.538191396 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.275467336 |
| Sum | 10338 |
| Variance | 5.990416051 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 460 | |
| 4 | 438 | |
| 5 | 334 | |
| 2 | 302 | |
| 6 | 232 | |
| 7 | 151 | 6.6% |
| 1 | 99 | 4.3% |
| 8 | 82 | 3.6% |
| 9 | 73 | 3.2% |
| 10 | 34 | 1.5% |
| Other values (10) | 73 | 3.2% |
| Value | Count | Frequency (%) |
| 0 | 15 | 0.7% |
| 1 | 99 | 4.3% |
| 2 | 302 | |
| 3 | 460 | |
| 4 | 438 |
| Value | Count | Frequency (%) |
| 19 | 1 | |
| 18 | 2 | |
| 17 | 1 | |
| 16 | 2 | |
| 15 | 2 |
| Distinct | 156 |
|---|---|
| Distinct (%) | 6.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.737708516 |
|---|---|
| Minimum | 0 |
| Maximum | 5.1 |
| Zeros | 15 |
| Zeros (%) | 0.7% |
| Memory size | 17.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1.54 |
| Q1 | 2.27 |
| median | 2.75 |
| Q3 | 3.24 |
| 95-th percentile | 3.94 |
| Maximum | 5.1 |
| Range | 5.1 |
| Interquartile range (IQR) | 0.97 |
Descriptive statistics
| Standard deviation | 0.7504414272 |
|---|---|
| Coefficient of variation (CV) | 0.2741129754 |
| Kurtosis | 0.7173874129 |
| Mean | 2.737708516 |
| Median Absolute Deviation (MAD) | 0.48 |
| Skewness | -0.2701449247 |
| Sum | 6236.5 |
| Variance | 0.5631623357 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.7 | 47 | 2.1% |
| 2.75 | 43 | 1.9% |
| 2.65 | 41 | 1.8% |
| 2.94 | 38 | 1.7% |
| 2.86 | 38 | 1.7% |
| 2.46 | 38 | 1.7% |
| 3.02 | 37 | 1.6% |
| 2.57 | 37 | 1.6% |
| 3.08 | 37 | 1.6% |
| 2.67 | 36 | 1.6% |
| Other values (146) | 1886 |
| Value | Count | Frequency (%) |
| 0 | 15 | |
| 0.3 | 1 | < 0.1% |
| 0.35 | 1 | < 0.1% |
| 0.57 | 1 | < 0.1% |
| 0.59 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 5.1 | 1 | |
| 4.97 | 1 | |
| 4.91 | 2 | |
| 4.86 | 2 | |
| 4.81 | 2 |
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.453028973 |
|---|---|
| Minimum | 0 |
| Maximum | 7 |
| Zeros | 476 |
| Zeros (%) | 20.9% |
| Memory size | 17.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.152125473 |
|---|---|
| Coefficient of variation (CV) | 0.7929129387 |
| Kurtosis | 0.9668612586 |
| Mean | 1.453028973 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.8347844656 |
| Sum | 3310 |
| Variance | 1.327393105 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 846 | |
| 2 | 546 | |
| 0 | 476 | |
| 3 | 311 | 13.7% |
| 4 | 69 | 3.0% |
| 5 | 20 | 0.9% |
| 6 | 7 | 0.3% |
| 7 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 476 | |
| 1 | 846 | |
| 2 | 546 | |
| 3 | 311 | 13.7% |
| 4 | 69 | 3.0% |
| Value | Count | Frequency (%) |
| 7 | 3 | 0.1% |
| 6 | 7 | 0.3% |
| 5 | 20 | 0.9% |
| 4 | 69 | 3.0% |
| 3 | 311 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.9 KiB |
| 0 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2278 |
|---|---|
| Distinct characters | 1 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 2278 |
| Value | Count | Frequency (%) |
| 0 | 2278 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2278 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2278 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 2278 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2278 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 2278 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2278 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 2278 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| State | Account length | Area code | International plan | Voice mail plan | Number vmail messages | Total day minutes | Total day calls | Total day charge | Total eve minutes | Total eve calls | Total eve charge | Total night minutes | Total night calls | Total night charge | Total intl minutes | Total intl calls | Total intl charge | Customer service calls | Churn | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | KS | 128 | 415 | No | Yes | 25 | 265.1 | 110 | 45.07 | 197.4 | 99 | 16.78 | 244.7 | 91 | 11.01 | 10.0 | 3 | 2.70 | 1 | 0 |
| 1 | OH | 107 | 415 | No | Yes | 26 | 161.6 | 123 | 27.47 | 195.5 | 103 | 16.62 | 254.4 | 103 | 11.45 | 13.7 | 3 | 3.70 | 1 | 0 |
| 2 | NJ | 137 | 415 | No | No | 0 | 243.4 | 114 | 41.38 | 121.2 | 110 | 10.30 | 162.6 | 104 | 7.32 | 12.2 | 5 | 3.29 | 0 | 0 |
| 3 | OH | 84 | 408 | Yes | No | 0 | 299.4 | 71 | 50.90 | 61.9 | 88 | 5.26 | 196.9 | 89 | 8.86 | 6.6 | 7 | 1.78 | 2 | 0 |
| 4 | OK | 75 | 415 | Yes | No | 0 | 166.7 | 113 | 28.34 | 148.3 | 122 | 12.61 | 186.9 | 121 | 8.41 | 10.1 | 3 | 2.73 | 3 | 0 |
| 5 | AL | 118 | 510 | Yes | No | 0 | 223.4 | 98 | 37.98 | 220.6 | 101 | 18.75 | 203.9 | 118 | 9.18 | 6.3 | 6 | 1.70 | 0 | 0 |
| 6 | MA | 121 | 510 | No | Yes | 24 | 218.2 | 88 | 37.09 | 348.5 | 108 | 29.62 | 212.6 | 118 | 9.57 | 7.5 | 7 | 2.03 | 3 | 0 |
| 7 | MO | 147 | 415 | Yes | No | 0 | 157.0 | 79 | 26.69 | 103.1 | 94 | 8.76 | 211.8 | 96 | 9.53 | 7.1 | 6 | 1.92 | 0 | 0 |
| 8 | WV | 141 | 415 | Yes | Yes | 37 | 258.6 | 84 | 43.96 | 222.0 | 111 | 18.87 | 326.4 | 97 | 14.69 | 11.2 | 5 | 3.02 | 0 | 0 |
| 9 | RI | 74 | 415 | No | No | 0 | 187.7 | 127 | 31.91 | 163.4 | 148 | 13.89 | 196.0 | 94 | 8.82 | 9.1 | 5 | 2.46 | 0 | 0 |
Last rows
| State | Account length | Area code | International plan | Voice mail plan | Number vmail messages | Total day minutes | Total day calls | Total day charge | Total eve minutes | Total eve calls | Total eve charge | Total night minutes | Total night calls | Total night charge | Total intl minutes | Total intl calls | Total intl charge | Customer service calls | Churn | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2268 | SD | 163 | 415 | Yes | No | 0 | 197.2 | 90 | 33.52 | 188.5 | 113 | 16.02 | 211.1 | 94 | 9.50 | 7.8 | 8 | 2.11 | 1 | 0 |
| 2269 | OK | 52 | 415 | No | No | 0 | 124.9 | 131 | 21.23 | 300.5 | 118 | 25.54 | 192.5 | 106 | 8.66 | 11.6 | 4 | 3.13 | 2 | 0 |
| 2270 | WY | 89 | 415 | No | No | 0 | 115.4 | 99 | 19.62 | 209.9 | 115 | 17.84 | 280.9 | 112 | 12.64 | 15.9 | 6 | 4.29 | 3 | 0 |
| 2271 | OH | 78 | 408 | No | No | 0 | 193.4 | 99 | 32.88 | 116.9 | 88 | 9.94 | 243.3 | 109 | 10.95 | 9.3 | 4 | 2.51 | 2 | 0 |
| 2272 | OH | 96 | 415 | No | No | 0 | 106.6 | 128 | 18.12 | 284.8 | 87 | 24.21 | 178.9 | 92 | 8.05 | 14.9 | 7 | 4.02 | 1 | 0 |
| 2273 | SC | 79 | 415 | No | No | 0 | 134.7 | 98 | 22.90 | 189.7 | 68 | 16.12 | 221.4 | 128 | 9.96 | 11.8 | 5 | 3.19 | 2 | 0 |
| 2274 | AZ | 192 | 415 | No | Yes | 36 | 156.2 | 77 | 26.55 | 215.5 | 126 | 18.32 | 279.1 | 83 | 12.56 | 9.9 | 6 | 2.67 | 2 | 0 |
| 2275 | WV | 68 | 415 | No | No | 0 | 231.1 | 57 | 39.29 | 153.4 | 55 | 13.04 | 191.3 | 123 | 8.61 | 9.6 | 4 | 2.59 | 3 | 0 |
| 2276 | RI | 28 | 510 | No | No | 0 | 180.8 | 109 | 30.74 | 288.8 | 58 | 24.55 | 191.9 | 91 | 8.64 | 14.1 | 6 | 3.81 | 2 | 0 |
| 2277 | TN | 74 | 415 | No | Yes | 25 | 234.4 | 113 | 39.85 | 265.9 | 82 | 22.60 | 241.4 | 77 | 10.86 | 13.7 | 4 | 3.70 | 0 | 0 |